Serveur d'exploration sur l'OCR

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

Composition of a Dewarped and Enhanced Document Image From Two View Images

Identifieur interne : 000A88 ( Main/Exploration ); précédent : 000A87; suivant : 000A89

Composition of a Dewarped and Enhanced Document Image From Two View Images

Auteurs : HYUNG IL KOO [Corée du Sud] ; Jinho Kim [Corée du Sud] ; NAM IK CHO [Corée du Sud]

Source :

RBID : Pascal:09-0350330

Descripteurs français

English descriptors

Abstract

In this paper, we propose an algorithm to compose a geometrically dewarped and visually enhanced image from two document images taken by a digital camera at different angles. Unlike the conventional works that require special equipments or assumptions on the contents of books or complicated image acquisition steps, we estimate the unfolded book or document surface from the corresponding points between two images. For this purpose, the surface and camera matrices are estimated using structure reconstruction, 3-D projection analysis, and random sample consensus-based curve fitting with the cylindrical surface model. Because we do not need any assumption on the contents of books, the proposed method can be applied not only to optical character recognition (OCR), but also to the high-quality digitization of pictures in documents. In addition to the dewarping for a structurally better image, image mosaic is also performed for further improving the visual quality. By finding better parts of images (with less out of focus blur and/or without specular reflections) from either of views, we compose a better image by stitching and blending them. These processes are formulated as energy minimization problems that can be solved using a graph cut method. Experiments on many kinds of book or document images show that the proposed algorithm robustly works and yields visually pleasing results. Also, the OCR rate of the resulting image is comparable to that of document images from a flatbed scanner.


Affiliations:


Links toward previous steps (curation, corpus...)


Le document en format XML

<record>
<TEI>
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en" level="a">Composition of a Dewarped and Enhanced Document Image From Two View Images</title>
<author>
<name sortKey="Hyung Il Koo" sort="Hyung Il Koo" uniqKey="Hyung Il Koo" last="Hyung Il Koo">HYUNG IL KOO</name>
<affiliation wicri:level="4">
<inist:fA14 i1="01">
<s1>Department of Electrical Engineering and Computer Science and the INMC, Seoul National University</s1>
<s2>Gwanak-gu, Seoul, 151-744</s2>
<s3>KOR</s3>
<sZ>1 aut.</sZ>
<sZ>3 aut.</sZ>
</inist:fA14>
<country>Corée du Sud</country>
<placeName>
<settlement type="city">Séoul</settlement>
</placeName>
<orgName type="university">Université nationale de Séoul</orgName>
</affiliation>
</author>
<author>
<name sortKey="Kim, Jinho" sort="Kim, Jinho" uniqKey="Kim J" first="Jinho" last="Kim">Jinho Kim</name>
<affiliation wicri:level="1">
<inist:fA14 i1="02">
<s1>Multimedia Laboratory, Telecommunication R&D Center, Samsung Electronics Company, Ltd</s1>
<s2>Suwon, Gyeonggi-do, 443-742</s2>
<s3>KOR</s3>
<sZ>2 aut.</sZ>
</inist:fA14>
<country>Corée du Sud</country>
<wicri:noRegion>Suwon, Gyeonggi-do, 443-742</wicri:noRegion>
</affiliation>
</author>
<author>
<name sortKey="Nam Ik Cho" sort="Nam Ik Cho" uniqKey="Nam Ik Cho" last="Nam Ik Cho">NAM IK CHO</name>
<affiliation wicri:level="4">
<inist:fA14 i1="01">
<s1>Department of Electrical Engineering and Computer Science and the INMC, Seoul National University</s1>
<s2>Gwanak-gu, Seoul, 151-744</s2>
<s3>KOR</s3>
<sZ>1 aut.</sZ>
<sZ>3 aut.</sZ>
</inist:fA14>
<country>Corée du Sud</country>
<placeName>
<settlement type="city">Séoul</settlement>
</placeName>
<orgName type="university">Université nationale de Séoul</orgName>
</affiliation>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">INIST</idno>
<idno type="inist">09-0350330</idno>
<date when="2009">2009</date>
<idno type="stanalyst">PASCAL 09-0350330 INIST</idno>
<idno type="RBID">Pascal:09-0350330</idno>
<idno type="wicri:Area/PascalFrancis/Corpus">000221</idno>
<idno type="wicri:Area/PascalFrancis/Curation">000558</idno>
<idno type="wicri:Area/PascalFrancis/Checkpoint">000204</idno>
<idno type="wicri:doubleKey">1057-7149:2009:Hyung Il Koo:composition:of:a</idno>
<idno type="wicri:Area/Main/Merge">000A98</idno>
<idno type="wicri:Area/Main/Curation">000A88</idno>
<idno type="wicri:Area/Main/Exploration">000A88</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title xml:lang="en" level="a">Composition of a Dewarped and Enhanced Document Image From Two View Images</title>
<author>
<name sortKey="Hyung Il Koo" sort="Hyung Il Koo" uniqKey="Hyung Il Koo" last="Hyung Il Koo">HYUNG IL KOO</name>
<affiliation wicri:level="4">
<inist:fA14 i1="01">
<s1>Department of Electrical Engineering and Computer Science and the INMC, Seoul National University</s1>
<s2>Gwanak-gu, Seoul, 151-744</s2>
<s3>KOR</s3>
<sZ>1 aut.</sZ>
<sZ>3 aut.</sZ>
</inist:fA14>
<country>Corée du Sud</country>
<placeName>
<settlement type="city">Séoul</settlement>
</placeName>
<orgName type="university">Université nationale de Séoul</orgName>
</affiliation>
</author>
<author>
<name sortKey="Kim, Jinho" sort="Kim, Jinho" uniqKey="Kim J" first="Jinho" last="Kim">Jinho Kim</name>
<affiliation wicri:level="1">
<inist:fA14 i1="02">
<s1>Multimedia Laboratory, Telecommunication R&D Center, Samsung Electronics Company, Ltd</s1>
<s2>Suwon, Gyeonggi-do, 443-742</s2>
<s3>KOR</s3>
<sZ>2 aut.</sZ>
</inist:fA14>
<country>Corée du Sud</country>
<wicri:noRegion>Suwon, Gyeonggi-do, 443-742</wicri:noRegion>
</affiliation>
</author>
<author>
<name sortKey="Nam Ik Cho" sort="Nam Ik Cho" uniqKey="Nam Ik Cho" last="Nam Ik Cho">NAM IK CHO</name>
<affiliation wicri:level="4">
<inist:fA14 i1="01">
<s1>Department of Electrical Engineering and Computer Science and the INMC, Seoul National University</s1>
<s2>Gwanak-gu, Seoul, 151-744</s2>
<s3>KOR</s3>
<sZ>1 aut.</sZ>
<sZ>3 aut.</sZ>
</inist:fA14>
<country>Corée du Sud</country>
<placeName>
<settlement type="city">Séoul</settlement>
</placeName>
<orgName type="university">Université nationale de Séoul</orgName>
</affiliation>
</author>
</analytic>
<series>
<title level="j" type="main">IEEE transactions on image processing</title>
<title level="j" type="abbreviated">IEEE trans. image process.</title>
<idno type="ISSN">1057-7149</idno>
<imprint>
<date when="2009">2009</date>
</imprint>
</series>
</biblStruct>
</sourceDesc>
<seriesStmt>
<title level="j" type="main">IEEE transactions on image processing</title>
<title level="j" type="abbreviated">IEEE trans. image process.</title>
<idno type="ISSN">1057-7149</idno>
</seriesStmt>
</fileDesc>
<profileDesc>
<textClass>
<keywords scheme="KwdEn" xml:lang="en">
<term>Algorithm</term>
<term>Blurred image</term>
<term>Curve fitting</term>
<term>Cylindrical shape</term>
<term>Digitizing</term>
<term>Document image processing</term>
<term>Graph cut</term>
<term>Graph method</term>
<term>Image processing</term>
<term>Image quality</term>
<term>Optical character recognition</term>
<term>Pattern recognition</term>
<term>Robust estimation</term>
<term>Specular reflection</term>
</keywords>
<keywords scheme="Pascal" xml:lang="fr">
<term>Traitement image document</term>
<term>Algorithme</term>
<term>Ajustement courbe</term>
<term>Forme cylindrique</term>
<term>Reconnaissance optique caractère</term>
<term>Numérisation</term>
<term>Qualité image</term>
<term>Image floue</term>
<term>Réflexion spéculaire</term>
<term>Méthode graphe</term>
<term>Coupe graphe</term>
<term>Estimation robuste</term>
<term>Traitement image</term>
<term>Reconnaissance forme</term>
</keywords>
<keywords scheme="Wicri" type="topic" xml:lang="fr">
<term>Numérisation</term>
</keywords>
</textClass>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">In this paper, we propose an algorithm to compose a geometrically dewarped and visually enhanced image from two document images taken by a digital camera at different angles. Unlike the conventional works that require special equipments or assumptions on the contents of books or complicated image acquisition steps, we estimate the unfolded book or document surface from the corresponding points between two images. For this purpose, the surface and camera matrices are estimated using structure reconstruction, 3-D projection analysis, and random sample consensus-based curve fitting with the cylindrical surface model. Because we do not need any assumption on the contents of books, the proposed method can be applied not only to optical character recognition (OCR), but also to the high-quality digitization of pictures in documents. In addition to the dewarping for a structurally better image, image mosaic is also performed for further improving the visual quality. By finding better parts of images (with less out of focus blur and/or without specular reflections) from either of views, we compose a better image by stitching and blending them. These processes are formulated as energy minimization problems that can be solved using a graph cut method. Experiments on many kinds of book or document images show that the proposed algorithm robustly works and yields visually pleasing results. Also, the OCR rate of the resulting image is comparable to that of document images from a flatbed scanner.</div>
</front>
</TEI>
<affiliations>
<list>
<country>
<li>Corée du Sud</li>
</country>
<settlement>
<li>Séoul</li>
</settlement>
<orgName>
<li>Université nationale de Séoul</li>
</orgName>
</list>
<tree>
<country name="Corée du Sud">
<noRegion>
<name sortKey="Hyung Il Koo" sort="Hyung Il Koo" uniqKey="Hyung Il Koo" last="Hyung Il Koo">HYUNG IL KOO</name>
</noRegion>
<name sortKey="Kim, Jinho" sort="Kim, Jinho" uniqKey="Kim J" first="Jinho" last="Kim">Jinho Kim</name>
<name sortKey="Nam Ik Cho" sort="Nam Ik Cho" uniqKey="Nam Ik Cho" last="Nam Ik Cho">NAM IK CHO</name>
</country>
</tree>
</affiliations>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/OcrV1/Data/Main/Exploration
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000A88 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/Main/Exploration/biblio.hfd -nk 000A88 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Ticri/CIDE
   |area=    OcrV1
   |flux=    Main
   |étape=   Exploration
   |type=    RBID
   |clé=     Pascal:09-0350330
   |texte=   Composition of a Dewarped and Enhanced Document Image From Two View Images
}}

Wicri

This area was generated with Dilib version V0.6.32.
Data generation: Sat Nov 11 16:53:45 2017. Site generation: Mon Mar 11 23:15:16 2024